Probing Lexical Ambiguity: Word Vectors Encode Number and Relatedness of Senses

نویسندگان

چکیده

Lexical ambiguity—the phenomenon of a single word having multiple, distinguishable senses—is pervasive in language. Both the degree ambiguity (roughly, its number senses) and relatedness those senses have been found to widespread effects on language acquisition processing. Recently, distributional approaches semantics, which word's meaning is determined by contexts, led successful research quantifying ambiguity, but these measures not distinguished between words with multiple related versus unrelated meanings. In this work, we present first assessment whether representations can capture structure word, including both senses. On very large sample English words, find that some, all, semantic test exhibit detectable differences sets monosemes (unambiguous words; N = 964), polysemes (with senses; 4,096), homonyms 355). Our findings begin answer open questions from earlier work regarding successfully various relationships, also reflect fine-grained aspects influence human behavior. emphasize importance measuring proposed lexical such distinctions: addition standard benchmarks similarity models, need consider they cognitively plausible structure.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using WordNet Lexical Database and Internet to Disambiguate Word Senses

The term “knowledge acquisition bottleneck” has been used in Word Sense Disambiguation Tasks (WSDTs) to illustrate/express the problem of the lack of large tagged corpora. In this paper, an automated WSDT is based on text corpora extracted / collected from Internet web pages. First, the disambiguation for the sense of a word, in a context, is based on the use of its definition and the definitio...

متن کامل

Limits of Lexical Semantic Relatedness with Ontology-based Conceptual Vectors

Conceptual vectors can be used to represent thematic aspects of text segments, which allow for the computation of semantic relatedness. We study the behavior of conceptual vectors based on an ontology by comparing the results to the Miller-Charles benchmark. We discuss the limits to such an approach due to explicit mapping, as well as the viability of the Miller-Charles dataset as a benchmark f...

متن کامل

Discovering word senses from a network of lexical cooccurrences

Lexico-semantic networks such as WordNet have been criticized about the nature of the senses they distinguish as well as on the way they define these senses. In this article, we present a possible solution to overcome these limits by defining the sense of words from the way they are used. More precisely, we propose to differentiate the senses of a word from a network of lexical cooccurrences bu...

متن کامل

Specialising Word Vectors for Lexical Entailment

We present LEAR (Lexical Entailment Attract-Repel), a novel post-processing method that transforms any input word vector space to emphasise the asymmetric relation of lexical entailment (LE), also known as the IS-A or hyponymy-hypernymy relation. By injecting external linguistic constraints (e.g., WordNet links) into the initial vector space, the LE specialisation procedure brings true hyponymy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Cognitive Science

سال: 2021

ISSN: ['0364-0213', '1551-6709']

DOI: https://doi.org/10.1111/cogs.12943